Search results for "Genomic Databases"
showing 9 items of 9 documents
Functional comparison of bacteria from the human gut and closely related non-gut bacteria reveals the importance of conjugation and a paucity of moti…
2016
International audience; The human GI tract is a complex and still poorly understood environment, inhabited by one of the densest microbial communities on earth. The gut microbiota is shaped by millennia of evolution to co-exist with the host in commensal or symbiotic relationships. Members of the gut microbiota perform specific molecular functions important in the human gut environment. This can be illustrated by the presence of a highly expanded repertoire of proteins involved in carbohydrate metabolism, in phase with the large diversity of polysaccharides originating from the diet or from the host itself that can be encountered in this environment. In order to identify other bacterial fun…
Reconfigurable Accelerator for the Word-Matching Stage of BLASTN
2013
BLAST is one of the most popular sequence analysis tools used by molecular biologists. It is designed to efficiently find similar regions between two sequences that have biological significance. However, because the size of genomic databases is growing rapidly, the computation time of BLAST, when performing a complete genomic database search, is continuously increasing. Thus, there is a clear need to accelerate this process. In this paper, we present a new approach for genomic sequence database scanning utilizing reconfigurable field programmable gate array (FPGA)-based hardware. In order to derive an efficient structure for BLASTN, we propose a reconfigurable architecture to accelerate the…
Reactome graph database: Efficient access to complex pathway data
2018
Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…
Genomic Databases Characteristics
2013
MiasDB: A Database of Molecular Interactions Associated with Alternative Splicing of Human Pre-mRNAs.
2016
Alternative splicing (AS) is pervasive in human multi-exon genes and is a major contributor to expansion of the transcriptome and proteome diversity. The accurate recognition of alternative splice sites is regulated by information contained in networks of protein-protein and protein-RNA interactions. However, the mechanisms leading to splice site selection are not fully understood. Although numerous databases have been built to describe AS, molecular interaction databases associated with AS have only recently emerged. In this study, we present a new database, MiasDB, that provides a description of molecular interactions associated with human AS events. This database covers 938 interactions …
A summary of genomic databases: overview and discussion
2009
In the last few years both the amount of electronically stored biological data and the number of biological data repositories grew up significantly (today, more than eight hundred can be counted thereof). In spite of the enormous amount of available resources, a user may be disoriented when he/she searches for specific data. Thus, the accurate analysis of biological data and repositories turn out to be useful to obtain a systematic view of biological database structures, tools and contents and, eventually, to facilitate the access and recovery of such data. In this chapter, we propose an analysis of genomic databases, which are databases of fundamental importance for the research in bioinfo…
Big Data in metagenomics: Apache Spark vs MPI.
2020
The progress of next-generation sequencing has lead to the availability of massive data sets used by a wide range of applications in biology and medicine. This has sparked significant interest in using modern Big Data technologies to process this large amount of information in distributed memory clusters of commodity hardware. Several approaches based on solutions such as Apache Hadoop or Apache Spark, have been proposed. These solutions allow developers to focus on the problem while the need to deal with low level details, such as data distribution schemes or communication patterns among processing nodes, can be ignored. However, performance and scalability are also of high importance when…
Comparative Mitogenomics of Leeches (Annelida: Clitellata): Genome Conservation and Placobdella-Specific trnD Gene Duplication.
2015
Mitochondrial DNA sequences, often in combination with nuclear markers and morphological data, are frequently used to unravel the phylogenetic relationships, population dynamics and biogeographic histories of a plethora of organisms. The information provided by examining complete mitochondrial genomes also enables investigation of other evolutionary events such as gene rearrangements, gene duplication and gene loss. Despite efforts to generate information to represent most of the currently recognized groups, some taxa are underrepresented in mitochondrial genomic databases. One such group is leeches (Annelida: Hirudinea: Clitellata). Herein, we expand our knowledge concerning leech mitochon…
Identification of factors involved in dimorphism and pathogenicity of Zymoseptoria tritici
2017
A forward genetics approach was applied in order to investigate the molecular basis of morphological transition in the wheat pathogenic fungus Zymoseptoria tritici. Z. tritici is a dimorphic plant pathogen displaying environmentally regulated morphogenetic transition between yeast-like and hyphal growth. Considering the infection mode of Z. tritici, the switching to hyphal growth is essential for pathogenicity allowing the fungus the host invasion through natural openings like stomata. We exploited a previously developed Agrobacterium tumefaciens-mediated transformation (ATMT) to generate a mutant library by insertional mutagenesis including more than 10,000 random mutants. To identify gene…